Comparative Study of Differentially Private Synthetic Data Algorithms from the NIST PSCR Differential Privacy Synthetic Data Challenge
نویسندگان
چکیده
Differentially private synthetic data generation offers a recent solution to release analytically useful while preserving the privacy of individuals in data. In order utilize these algorithms for public policy decisions, policymakers need an accurate understanding algorithms' comparative performance. Correspondingly, practitioners also require standard metrics evaluating analytic qualities this paper, we present in-depth evaluation several differentially using actual sets created by contestants National Institute Standards and Technology Public Safety Communications Research (NIST PSCR) Division's ``"Differential Privacy Synthetic Data Challenge." We offer analyses based on both accuracy they create their usability potential providers. frame methods used NIST PSCR challenge within broader literature. implement additional utility metrics, including two our own, compare mechanism three categories. Our assessment synthesis quality shows relative usefulness, general strengths weaknesses, preferred choices metrics. Finally describe implications seeking future products.
منابع مشابه
DPSynthesizer: Differentially Private Data Synthesizer for Privacy Preserving Data Sharing
Differential privacy has recently emerged in private statistical data release as one of the strongest privacy guarantees. Releasing synthetic data that mimic original data with Differential privacy provides a promising way for privacy preserving data sharing and analytics while providing a rigorous privacy guarantee. However, to this date there is no open-source tools that allow users to genera...
متن کاملDi↵erentially Private Verification of Predictions from Synthetic Data
Di↵erentially Private Verification of Predictions from Synthetic Data by Haoyang Yu Program in Statistical and Economic Modeling Duke University
متن کاملA Comparative Study of Some Clustering Algorithms on Shape Data
Recently, some statistical studies have been done using the shape data. One of these studies is clustering shape data, which is the main topic of this paper. We are going to study some clustering algorithms on shape data and then introduce the best algorithm based on accuracy, speed, and scalability criteria. In addition, we propose a method for representing the shape data that facilitates and ...
متن کاملGradually Releasing Private Data under Differential Privacy
Aggregating individuals’ data and computing statistics over a population are key ingredients to enable the Internet of Things [1]. Constructing traffic maps from individuals’ GPS traces [2] and performing demand response in smart grids [3], [4] are two examples that involve such data aggregation. Using these statistics, individuals can perform their activities more efficiently; they may choose ...
متن کاملa study on insurer solvency by panel data model: the case of iranian insurance market
the aim of this thesis is an approach for assessing insurer’s solvency for iranian insurance companies. we use of economic data with both time series and cross-sectional variation, thus by using the panel data model will survey the insurer solvency.
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The journal of privacy and confidentiality
سال: 2021
ISSN: ['2575-8527']
DOI: https://doi.org/10.29012/jpc.748